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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Administrators of Tulane Educational Fund 
Philipp, Mario T. 

(ii) TITLE OF INVENTION: Surface Antigens and Proteins Useful in 
Compositions for the Diagnosis and Prevention of Lyn>e 
Disease 

(iii) NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Howson and Howson 

(B) STREET: Spring House Corporate Cntr.r P.O. Box 457 

(C) CITY: Spring House 

(D) STATE: Pennsylvania 

(E) COUNTRY: USA 

(F) ZIP: 19477 

. (vj COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS/MS-DOS 

(D) SOFTWARE: Patentin Release «1.0, Version 1^1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: WO 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/051,271 

(B) FILING DATE: 30-JUN-1997 

(viii) ATTORJ^EY/AGENT INFORMATION: 

(A) NAME: Bak, Mary E. 

(B) REGISTRATION NUMBER: 31,215 

(C) REFERENCE/DOCKET NUMBER: TUL2APCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 215-54 0--9200 

(B) TELEFAX: 215-540-5818 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(D) LOCATION: 1..1047 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

AAG AAT AAT GAT CAT GAT AAT CAT AAG GGG ACT GTT AAG AAT GCT GTT 4 8 

Lys Asn Asn Asp His Asp Asn His Lys Gly Thr Val Lys Asn Ala Val 

1 5 10 15 

GAT ATG GCA AAG GCC GCT GAG GAA GCT GCA AGT GCT GCA AGT GCT GCT 96 

Asp Met Ala Lys Ala Ala Glu Glu Ala Ala Ser Ala Ala Ser Ala Ala 

20 25 30 

ACT GGT AAT GCA GCG ATT GGG GAT GTT GTT AAG AAT AGT GGG GCA GCA 14 4 

Thr Gly Asn Ala Ala lie Gly Asp Val Val Lys Asn Ser Gly Ala Ala 

35 40 45 

GCA AAA GGT GGT GAG GCG GCG AGT GTT AAT GGG ATT GCT AAG GGG ATA 192 

Ala Lys Gly Gly Glu Ala TVla Ser Val Asn Gly lie Ala Lys Gly He 

50 55 60 

AAG GGG ATT GTT GAT GCT GCT GGA AAG GCT GAT GCG AAG GAA GGG AAG 24 0 

Lys Gly He Val Asp Ala Ala Gly Lys Ala Asp Ala Lys Glu Gly Lys 

65 70 75 80 

TTG GAT GCT ACT GGT GCT GAG GGT ACG ACT AAC GTG AAT GCT GGG AAG 20 8 

Leu Asp Ala Thr Gly Ala Glu Gly Thr Thr Asn Val Asn Ala Gly Lys 

85 90 95 

TTG TTT GTG AAG AGG GCG GCT GAT GAT GGT GGT GAT GCA GAT GAT GCT 336 

Leu Phe Val Lys Arg Ala Ala Asp Asp Gly Gly Asp Ala Asp Asp Ala 

100 105 110 

GGG AAG GCT GCT GCT GCG GTT GCT GCA AGT GCT GCT ACT GGT AAT GCA 384 

Gly Lys Ala Ala Ala Ala Val Ala Ala Ser Ala Ala Thr Gly Asn Ala 

115 120 125 

GCG ATT GGA GAT GTT GTT AAT GGT GAT GTG GCA AAA GCA AAA GGT GGT 432 

Ala He Gly Asp Val Val Asn Gly Asp Val Ala Lys Ala Lys Gly Gly 

130 135 140 

GAT GCG GCG AGT GTT 7KAT GGG ATT GCT AAG GGT ATA AAG GGG ATT GTT 4 80 

Asp Ala Ala Ser Val Asn Gly He Ala Lys Gly He Lys Gly He Val 

145 150 155 160 

GAT GCT GCT GAG AAG GCT GAT GCG AAG GAA GGG AAG TTG AAT GCT GCT 528 

Asp Ala Ala Glu Lys Ala Asp Ala Lys Glu Gly Lys Leu Asn Ala Ala 

165 170 175 

GGT GCT GAG GGT ACG ACT AAC GCG GAT GCT GGG AAG TTG TTT GTG AAG 576 

Gly Ala Glu Gly Thr Thr Asn Ala Asp Ala Gly Lys Leu Phe Val Lys 

180 185 190 

AAT GCT GGT AAT GTG GGT GGT GAA GCA GGT GAT GCT GGG AAG GCT GCT 62 4 
Asn Ala Gly Asn Val Gly Gly Glu Ala Gly Asp Ala Gly Lys Ala Ala 

195 200 205 
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GCT GCG GTT GCT GCT GTT AGT GGG GAG GAG ATA TTA AAA GCG ATT GTT 672 
Ala Ala Val Ala Ala Val Ser Gly Glu Gin lie Leu Lys Ala lie Val 
210 215 220 

.CAT GCT GCT AAG GAT GGT GGT GAG AAG CAG GGT AAG AAG GCT GCG GAT 720 
His Ala Ala Lys Asp Gly Gly Glu Lys Gin Gly Lys Lys Ala Ala Asp 
225 230 235 240 

CGT ACA AAT CCC ATT GAC GCG GCT ATT GGG GGT GCG GGT GAT 7VAT GAT 7 60 

Arg Thr Asn Pro lie Asp Ala Ala lie Gly Gly Ala Gly Asp Asn Asp 
245 250 255 

GCT GCT GCG GCG TTT GCT ACT ATG AAG AAG GAT GAT CAG ATT GCT GCT 816 
Ala Ala Ala Ala Phe Ala Thr Met Lys Lys Asp Asp Gin lie Ala Ala 

260 265 270 

GCT ATG GTT CTG AGG GGA ATG GCT AAG GAT GGG CAA TTT GCT TTG AAG 664 
Ala Met Val Leu Arg Gly Met Ala Lys Asp Gly Gin Phe Ala Leu Lys 
275 280 285 

GAT GCT GCT GCT GCT CAT GAA GGG ACT GTT AAG AAT GCT GTT GAT ATA 912 
Asp Ala Ala Ala Ala His Glu Gly Thr Val Lys Asn Ala Val Asp He 
290 295 300 

ATA AAG GCT GCT GCG GAA GCT GCA AGT GCT GCA AGT GCT GCT ACT GGT 960 
He Lys Ala Ala Ala Glu Ala Ala Ser Ala Ala Ser Ala Ala Thr Gly 
305 310 315 320 

AGT GCA GCA ATT GGG GAT GTT GTT AAT GGT AAT GGA GCA ACA GCA AAA 1008 
Ser Ala Ala He Gly Asp Val Val Asn Gly Asn Gly Ala Thr Ala Lys 
325 330 335 

GGT GGT GAT GCG AAG AGT GTT AAT GGC ATT GCT AAG GGA 1047 
Gly Gly Asp Ala Lys Ser Val Asn Gly He Ala Lys Gly 
340 345 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Lys Asn Asn Asp His Asp Asn His Lys Gly Thr Val Lys Asn Ala Val 
15 10 15 

Asp Met Ala Lys Ala Ala Glu Glu Ala Ala Ser Ala Ala Ser Ala Ala 
20 25 30 



Thr Gly Asn Ala Ala He Gly Asp Val Val Lys Asn Ser Gly Ala Ala 
35 40 45 
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Ala Lys Gly Gly Glu Ala Ala Ser Val Asn Gly lie Ala Lys Gly lie 
50 55 60 

Lys Gly lie Val Asp Ala Ala Gly Lys Ala Asp Ala Lys Glu Gly Lys 
€5 70 75 80 

Leu Asp Ala Thr Gly Ala Glu Gly Thr Thr Asn Val Asn Ala Gly Lys 
85 90 95 

Leu Phe Val Lys Arg Ala Ala Asp Asp Gly Gly Asp Ala Asp Asp Ala 
100 105 110 

Gly Lys Ala Ala Ala TVla Val Ala Ala Ser Ala Ala Thr Gly Asn Ala 
115 120 125 

Ala lie Gly Asp Val Val Asn Gly Asp Val Ala Lys Ala Lys Gly Gly 
130 135 140 

Asp Ala Ala Ser Val Asn Gly lie Ala Lys Gly lie Lys Gly lie Val 
145 150 155 160 

Asp Ala Ala Glu Lys Ala Asp Ala Lys Glu Gly Lys Leu Asn Ala Ala 
165 170 175 

Gly Ala Glu Gly Thr Thr Asn Ala Asp Ala Gly Lys Leu Phe Val Lys 
ISO 185 190 

Asn Ala Gly Asn Val Gly Gly Glu Ala Gly Asp Ala Gly Lys Ala Ala 
195 200 205 

Ala Ala Val Ala Ala Val Ser Gly Glu Gin lie Leu Lys Ala lie Val 
210 215 220 

His Ala Ala Lys Asp Gly Gly Glu Lys Gin Gly Lys Lys Ala Ala Asp 
225 230 235 240 

Arg Thr Asn Pro lie Asp Ala Ala lie Gly Gly Ala Gly Asp Asn Asp 
245 250 255 

Ala Ala Ala Ala Phe Ala Thr Met Lys Lys Asp Asp Gin lie Ala Ala 
260 265 270 

Ala Met Val Leu Arg Gly Met Ala Lys Asp Gly Gin Phe Ala Leu Lys 
275 280 285 

Asp Ala Ala Ala Ala His Glu Gly Thr Val Lys Asn Ala Val Asp lie 
290 295 300 

lie Lys Ala Ala Ala Glu Ala Ala Ser Ala Ala Ser Ala Ala Thr Gly 
305 310 315 320 

Ser Ala Ala lie Gly Asp Val Val Asn Gly Asn Gly Ala Thr Ala Lys 
325 330 335 

Gly Gly Asp Ala Lys Ser Val Asn Gly He Ala Lys Gly 
340 345 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 283 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GCCGCTGGAT GGTGGTGAGA AGCAGGGTAA GAAGGCTGCG GATCGTACAA ATCCCATTGA 
CGCGGCTATT GGGGGTGCGG GTGATAATGA TGCTGCTGCG GCGTTTGCTA CTATGAAGAA 
GGATGATCAG ATTGCTGCTG CTATGGTTCT GAGGGGAATG GCTAAGGATG GGCAATTTGC 
TTTGAAGGAT GCTGCTGCTG CTCATGAAGG GACTGTTAAG AATGCTGTTG ATATAATAAA 
GGCTGCTGCG GAAGCTGCAA GTGCTGCAAG TGCTGCTACT GGT 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
TTTATTATAT CAACAGATTC TTAACAGTCC CTTCATGAGC AGCAGCAGCA TCCTTCAAAG 
CAAATTGCCC ATCCTTAGCC ATTCCCCTCA GAACCATAGC AGCAGCAATC TGATCATCCT 
TCTTCATAGT AGCAAACGCC GCAGCAGCAT CATTATCACC CGCACCCCCA ATAGCCGCGT 
CTUVTCGGATT TGTACGATCC GCAGCCTTCT TACCCTGCTT CTCACCACCA TCC 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: DNA (genomic) 



3 



6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



CCGTGCAAGC TGGGTTGAAG AAGGTTGGGG ATGTTGTTAA GAATAGTGAG GCAAAAGATG 



60 



GTGATGCGGC GAGTGTTAAT GGGATTGCTA AGGGGATAAA GGGGATTGTT GATGCTGCTG 



120 



AGAAGGCTGA TGCGAAGGAA GGGAAGTTGG TATGTGGCTG GTGCTGCTGG TGAAACTAAC 



180 



AAGGAAGCGG CCGC 



194 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 369 base pairs 

(B) TYPE: nucleic acid- 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA {genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

GCGGCCGCTT GAGGAAGCTG CAAGTGCTGC AAGTGCTGCT ACTGGTAATG CAGCGATTGG 60 

GGATGTTGTT AAGAATAGTG GGGCAGCAGC AAAAGGTGGT GAGGCGGCGA GTGTTAATGG 12 0 

GATTGCTAAG GGGATAAAGG GGATTGTTGA TGCTGCTGGA AAGGCTGATG CGAAGGAAGG 180 

GAAGTTGGAT GCTACTGGTG CTGAGGGTAC GACTAACGTG AATGCTGGGA AGTTGTTTGT 24 0 

GAAGAGGGCG GCTGATGATG GTGGTGATGC AGATGATGCT GGGAAGGCTG CTGCTGCGGT 300 

TGCTGCAAGT GCTGCTACTG GTAATGCAGC GATTGGAGAT GTTGTTAATG GTGATGTGGC 360 

ATVAACAT^AA 369 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUE14CE CHARACTERISTICS; 

(A) LENGTH: 142 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 



T^lAGGATGCTG atgataagca gggtaagaag gctgaggatg ctacaaatcc gattgacgcg 



GCTATTGGGG GTGCAGGTGC GGGTGCTAAT GCTGCTGCGG CGTTTAATAA TATGAAGAAG 



GATGATCAGA TTGAGCGGCC GC 



60 
120 
142 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 base pairs 

(B) TYPE: nucleic acid 
(Cj STRANDEDNESS: double 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



GGTGAAACTA ACAAGGATGC TGGGAAGTTG TTTGTGAAGA AGAATGGTGA TGATGGTGGT 60 

GATGCAGGTG ATGCTGGGAA GGCTGCTGCT GCGGTTGCTG CTGTTAGTGG GGAGCAGATA 120 

TTAAAAGCGA TTGTTGATGC TGCTAAAGAT GGTGATAAGA CGGGGGTTAC TGATGTAAAG 180 

GATGCTACAA ATCCGATTGA CGCGGCTATT 210 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



TATATAATAA AGGCTGCTGC GAAGCTGCAA GTGCTGCAAG TGCTGCTACT GGTAGTGCAG 60 

CAATTGGGGA TGTTGTTAAT GGTAATGGAG CAACAGCAAA AGGTGGTGAT GCGAAGTGTT 120 

AATGGGATTG CTAAGGGGAT AAAGGGGATT GTTGATGCTG CTGAGAAGGC TGATGCGAAG 180 

GAAGGGAAGT TGGATGTGGC TGGTGATGCT GGTGAAACTA ACAAGGAAGC GGCCGC 236 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 199 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
ATGAGAGGAT CTCATCACCA TCACCATCAC ACGGATCCCC CGGGCTGCAG GAATTCGCGG 60 
CCGCTGAAGG CTGATGCGAA GGAAGGGAAG TTGGATGTGG CTGGTGCTGC TGGTGAAACT 120 
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AACAAGGATG CTGGGAAGTT GTTTGTGAAG AAGAATAATG AGGGTGGTGA AGCAAATGAT 18 0 

GCTGGGAAGG CTGCTGCTG 199 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



GCCGCTGGAT GATCAGATTG CTGCTGCTAT GGTTGTGAGG GGAATGGCTA AGGATGGGCA 60 

GTTTGCTTTG AAGGATGATG CTGCTAAGGA TGGAGATAAA ACGGGGGTTG CTGCGGATGT 12 0 

GAAAATCCGA TTGACGCGGC TATTGGGGGT GCGGATGCTG ATGCTGCGGC GTTTAATAAG 180 

GAGGGGATGA AGAAGGATGA TCAGATTGCT GCTGCTATGG TTCTGAGGGG AATGGCTAAG 24 0 

GATGGGCAGT TTGCTTTGAC GAATAATGCT GC 272 



(2) INFORl^Js^TION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 289 base pairs 

(B) TYPE: nucleic acid 

(C) STIU^DEDUESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



ACTGTTAAGA ATGCTGTTGA TATAATAAAG GCTGCTGCGG AAGCTGCAAG TGCTGCAAGT 60 

GCTGCTACTG GTAGTGCAGC AATTGGGGAT GTTGTTAATG GTAATGGAGC AACAGCAAAA 120 

GGTGGTGATG CGAAGAGTGT TAATGGGATT GCTAAGGGGA TAAAGGGGAT TGTTGATGCT 180 

GCTGAGAAGG CTGATGCGAA GGAAGGGAAG TTGGATGTGG CTGGTGATGC TGGTGAAACT 24 0 

AACAAGGATG CTGGGAAGTT GTTTGTGAAG AACAATGGTA ATGAGGGTA 289 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 2 ,.142 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

G CCG CTT ACA AAT CCG ATT GAC GCG GCT ATT GGG GGG AGT GCG GAT 4 6 

Pro Leu Thr Asn Pro lie Asp Ala Ala lie Gly Gly Ser Ala Asp 
15 10 15 

CGT AAT GCT GAG GCG TTT GAT AAG ATG AAG AAG GAT GAT CAG ATT GCT 94 
Arg Asn Ala Glu Ala Phe Asp Lys Met Lys Lys Asp Asp Gin lie Ala 

20 25 30 

GCT GCT ATG GTT CTG AGG GGA ATG GCT AAG GAT GGG CAG TTT GCT TTG 142 
Ala Ala Met Val Leu Arg Gly Met Ala Lys Asp Gly Gin Phe Ala Leu 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Pro Leu Thr Asn Pro lie Asp Ala Ala lie Gly Gly Ser Ala Asp Arg 
15 10 15 

Asn Ala Glu Ala Phe Asp Lys Met Lys Lys Asp Asp Gin lie Ala Ala 
20 25 30 

Ala Met Val Leu Arg Gly Met Ala Lys Asp Gly Gin Phe Ala Leu 
35 40 45 



